3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Not Applicable
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Named Entity Recognition
-
Paper title:Biomedical entity extraction using machine-learning based approaches
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Cyril Grouin | LIMSI-CNRS | FR |
| Main Contact | Cyril Grouin | LIMSI-CNRS | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Trilingual
Languages:
English Finnish Turkish
Availability:
Freely Available
License:
<Not Specified>
Size:
millions sentencesProduction Status:
Existing-used
Use:
Language Modelling
-
Paper title:Morfessor FlatCat: An HMM-Based Method for Unsupervised and Semi-Supervised Learning of Morphology
-
Paper track:Morphology, word segmentation, tagging and chunking
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Stig-Arne Grönroos | Aalto University, Department of Signal Processing and Acoustics | FI |
| Author 2 | Sami Virpioja | Aalto University | FI |
| Author 3 | Peter Smit | Aalto University | None |
| Author 4 | Mikko Kurimo | Aalto University | FI |
| Main Contact | Stig-Arne Grönroos | Aalto University, Department of Signal Processing and Acoustics | None |
Documentation:
<Not Specified>
Written
Tagger/Parser,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
NON-EXCLUSIVE ACADEMIC USE LICENSE
Size:
56 MByteProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Comparable Study of Event Extraction in Newswire and Biomedical Domains
-
Paper track:IE/database linking
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Makoto Miwa | Toyota Technological Institute | JP |
| Author 2 | Paul Thompson | The National Centre for Text Mining, The University of Manchester | GB |
| Author 3 | Yannis Korkontzelos | National Centre for Text Mining, The University of Manchester | GB |
| Author 4 | Sophia Ananiadou | University of Manchester | GB |
| Main Contact | Makoto Miwa | Toyota Technological Institute | None |
Documentation:
<Not Specified>
Written
Evaluation Package,
Language Type:
Multilingual
Languages:
English french
Availability:
Freely Available
License:
Same Creative Commons licence under which the TED conference talks data is already available
Size:
250 pronouns; 2,093 parallel sentences; 45,351 English tokens; 48,266 French tokens OtherProduction Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:PROTEST: A Test Suite for Evaluating Pronouns in Machine Translation
-
Paper track:Evaluation
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Liane Guillou | University of Edinburgh | GB |
| Author 2 | Christian Hardmeier | Uppsala universitet | SE |
| Main Contact | Liane Guillou | University of Edinburgh | None |
Documentation:
Documentation is available in English and will be included in the (publicly available) evaluation package
Written
Corpus,
Language Type:
Bilingual
Languages:
English french
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Comparability measure assement
-
Paper title:Variations on quantitative comparability measures and their evaluations on synthetic French-English comparable corpora
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Guiyao Ke | Université de Bretagne Sud | FR | Universite de Bretagne Sud, IRISA | FR |
| Author 2 | Pierre-Francois Marteau | Universite de Bretagne Sud, IRISA | FR | ||
| Author 3 | Gildas Menier | Universite de Bretagne Sud, IRISA | FR | ||
| Main Contact | Pierre-Francois Marteau | Universite de Bretagne Sud, IRISA | None |
Documentation:
<Not Specified>
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
6000 entries Production Status:
Existing-used
Use:
Question Answering
-
Paper title:High Accuracy Rule-based Question Classification using Question Syntax and Semantics
-
Paper track:Syntactic and Semantic Parsing, Grammar Induction
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Harish Tayyar Madabushi | University of Birmingham | GB | ||
| Author 2 | Mark Lee | University of Birmingham | N/A | School of Computer Science, University of Birmingham, UK | None |
| Main Contact | Harish Tayyar Madabushi | University of Birmingham | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
250k tokens Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Language Independent Dependency to Constituent Tree Conversion
-
Paper track:Syntactic and Semantic Parsing, Grammar Induction
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Young-Suk Lee | IBM Research | US |
| Author 2 | Zhiguo Wang | IBM Watson Research Center | US |
| Main Contact | Young-Suk Lee | IBM Research | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English italian
Availability:
Freely Available
License:
CreativeCommons
Size:
Word Alignment links: 20,000 OtherProduction Status:
Newly created-finished
Use:
Word Alignment
-
Paper title:WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Luisa Bentivogli | Fondazione Bruno Kessler | IT |
| Author 2 | Mauro Cettolo | FBK | IT |
| Author 3 | M. Amin Farajian | University of Trento, FBK | IT |
| Author 4 | Marcello Federico | FBK | IT |
| Main Contact | Luisa Bentivogli | Fondazione Bruno Kessler | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-finished
Use:
Question Answering
-
Paper title:Easy Questions First? A Case Study on Curriculum Learning for Question Answering
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Mrinmaya Sachan | Carnegie Mellon University | US |
| Author 2 | Eric Xing | <Not Specified> | US |
| Main Contact | Mrinmaya Sachan | Carnegie Mellon University | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Open Source
Size:
400 documents OtherProduction Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Experiments with Convolutional Neural Networks for Multi-Label Authorship Attribution
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Dainis Boumber | University of Houston | US |
| Author 2 | Yifan Zhang | University of Houston | US |
| Author 3 | Arjun Mukherjee | University of Houston | US |
| Main Contact | Dainis Boumber | University of Houston | None |
Documentation:
https://github.com/dainis-boumber/ AA_CNN/wiki/MLPA-400-Dataset




